A Corpus-Based Lexical Resource Of German Idioms

نویسندگان

  • Gerald Neumann
  • Christiane Fellbaum
  • Alexander Geyken
  • Axel Herold
  • Christiane Hummer
  • Fabian Korner
  • Undine Kramer
  • Kerstin Krell
  • Alexey Sokirko
  • Diana Stantcheva
  • Ekatherini Stathi
چکیده

In this paper, we present the design of a lexical resource focusing on German verb phrase idioms. The entry for a given idiom combines appropriate corpus examples with rich linguistic and lexicographic annotations, giving the user access to linguistic information coupled with the appropriate attested data and laying the groundwork for empirical, reproducible research.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Adding nominal spice to SALSA - frame-semantic annotation of German nouns and verbs

This paper presents Release 2.0 of the SALSA corpus, a German resource for lexical semantics. The new corpus release provides new annotations for German nouns, complementing the existing annotations of German verbs in Release 1.0. The corpus now includes around 24,000 sentences with more than 36,000 annotated instances. It was designed with an eye towards NLP applications such as semantic role ...

متن کامل

Etymology, Contextual Pragmatic Clues, and Lexical Knowledge in L2 Idioms Learning

To investigate the effects of etymological elaboration, contextual pragmatic clues, and lexical knowledge on L2 idioms comprehension and production, 60 male intermediate level EFL students in three groups were selected. Each group was randomly assigned to one treatment condition. Group one participants were presented with the etymological explanation of idioms. In group two, the same idioms wer...

متن کامل

Cross-Lingual Induction for Deep Broad-Coverage Syntax: A Case Study on German Participles

This paper is a case study on cross-lingual induction of lexical resources for deep, broad-coverage syntactic analysis of German. We use a parallel corpus to induce a classifier for German participles which can predict their syntactic category. By means of this classifier, we induce a resource of adverbial participles from a huge monolingual corpus of German. We integrate the resource into a Ge...

متن کامل

Lexical Variability and Compositionality: Investigating Idiomaticity with Distributional Semantic Models

In this work we carried out an idiom type identification task on a set of 90 Italian V-NP and V-PP constructions comprising both idioms and non-idioms. Lexical variants were generated from these expressions by replacing their components with semantically related words extracted distributionally and from the Italian section of MultiWordNet. Idiomatic phrases turned out to be less similar to thei...

متن کامل

A Corpus-based Study of Lexical Bundles in Discussion Section of Medical Research Articles

There has been increasing interest in utilizing corpora in linguistic research and pedagogy in recent years. Rhetorical organization of different sections of research articles may appear similar in various disciplines, but close examination may show subtle differences nonetheless. One of the features that has been at the center of attention especially in recent years is the idiomaticity of a di...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004